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cDNA CLONE FOR SOUTH AFRICAN ARBOVIRUS NO. 86 

Field of the Invention 

The present invention relates to live attenuated vaccines in general, 
and particularly relates to attenuated vaccines produced from South African 
Arbovirus No. 86 (S.A.AR86) virus. 

5 Background of the Invention 

This invention was made with government support under Grant No. 
2-ROIA122186 (09-13) awarded by the National Institutes of Health. The 
government has certain rights in the invention. 

South African Arbovirus No. 86 (S. A. AR86) is an isolate of Sindbis 

10 virus. S.A.AR 86 was originally isolated from mosquitoes. The virus is 
antigenically related to Sindbis virus and to two other Sindbis virus isolates, 
Girdwood S.A. and Ockelbo82. See, Malherbe et al., South African Medical 
Journal, 37:547 (1963) and Niklasson et al., Am. J. Trop. Med. Hyg. 33:1212 
(1984), respectively. The latter is associated with a human disease, also known 

15 as Ockelbo. Sindbis virus is the prototype member of the alphavirus genus of the 
family Togaviridae. The Sindbis virus includes various strains, including 
S.A.AR86 and Sindbis AR339. The genome of Sindbis viruses consists of a single 
strand of RNA which contains the information for the viral genes, and which is 
infectious when introduced into the cytoplasm of cells. 

20 Full-length cDNA clones of positive-strand RNA viruses are 

important tools for the study of the biology of viruses including Sindbis viruses. 
It is known with respect to several viral systems that in vitro transcripts of cDNA 
clones, and in some cases the cDNA itself, can initiate a complete and productive 
infectious cycle upon introduction into susceptible cells. See Racaniello et aL, 

25 Science 214:916 (1981); Ahlquist et al., Proc. Natl. Acad. Sci. USA 81:7066 
(1984); Kaplan et al., Proc. Natl Acad. ScL USA 82:8424 (1985); Mizutani et al., 
/. Virol 56:628 (1985); van der Werf, Proc. Natl Acad. ScL USA 83:2330 
(1986); Rice et al., /. Virol. 61:3809 (1987); and Vos et al., Virology 165:33 
(1988). This has made it possible to test progeny virus for phenotypic 
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manifestations of directed mutations and recombinations which have been 
introduced into the cDNA clone. Pathogenesis studies with several positive-strand 
viruses, including the picomaviruses and the alpha viruses have been advanced 
significantly by the use of full-length cDNA clones. 
5 As another useful application, live attenuated viral vaccines may be 

produced using full-length cDN A clones. Live attenuated viral vaccines are among 
the most successful means of controlling viral disease. However for some virus 
pathogens, immunization with a live virus strain may be either impractical or 
unsafe. Sindbis virus is subclinical in humans, but is closely related to other 

10 viruses which do induce clinical diseases in humans, such as Ockelbo, an epidemic 
polyarthritis syndrome common in areas of Scandinavia and Northern Europe. 
Accordingly, Sindbis virus vaccines are desirable for producing an immunogenic 
response to such clinical diseases. Sindbis vims vaccines are also desirable as 
viral carriers in virus constructs which express genes encoding immunizing 

15 antigens for other viruses. See U.S. Patent No. 5,217,879 to Huang et al. Huang 
et al. describes Sindbis infectious viral vectors. However, the reference does not 
describe the cDNA sequence of S.A.AR86 virus, or clones or viral vectors 
produced therefrom. 

Accordingly, there remains a need in the art for full-length cDNA 

20 clones of positive-strand RNA viruses, such as the S.A.AR86 strain of Sindbis. 
In addition, there is a need in the art for full-length cDNA clones of S.A.AR86 
encoding infectious RNA transcripts. Further, there remains a need in the art for 
cDNA clones of S. A. AR86 which encode RNA transcripts which may be used to 
produce infectious attenuated viral particles, and methods of producing such viral 

25 particles. 

Summary of the Invention 

As a first aspect, the present invention provides a recombinant 
DNA comprising a cDNA coding for an infectious South African Arbovirus No. 
86 (S.A.AR86) virus RNA transcript and a heterologous promoter positioned 
30 upstream from the cDNA and operatively associated therewith. The, cDNA is 
selected from the group consisting of (z) cDNA having the sequence given herein 
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as SEQ ID NO.: 1, (ii) cDNA having the same RNA coding sequence as the 
cDNA given herein as SEQ ID NO.: 1, and (Hi) cDNA according to (/) or (if) 
above and further containing at least one attenuating mutation. Preferably at least 
one attenuating mutation is included in the cDNA, and more preferably at least 
5 two attenuating mutations are included in the cDNA. Attenuating mutations may, 
for example, be provided in any of the nsPl, E2, and nsP2 coding regions of the 
cDNA. Preferably at least one silent mutation is included in the cDNA in addition 
to the attenuating mutation(s). jf 

As a second aspect, the present invention provides an infectious 
10 RNA transcript encoded by the cDNA. 

As a third aspect, the present invention provides infectious 
attenuated viral particles containing the RNA transcript encoded by the cDNA. 

The foregoing and other aspects of the present invention are 
explained in detail in the detailed description set forth below. 

15 Brief Description of the Drawings 

Figure 1 shows the relationship of the 3' half of the nsP3 gene 

among various Sindbis-like isolates. 

Figure 2 shows the replacement of of the AR339-derived cDNAs 

into the plasmid pTotollOl background. 

20 Detailed Description of the Invention 

The South African Arbovirus No. 86 (S.A.AR86) viral clones 
employed in practicing the present invention are genomic clones which code for 
an RNA transcript, which RNA trasncript is capable of producing live 
encapsidated S.A.AR86 virus when used to transfect a S. A. AR86 virus-permissive 
25 cell. 

S.A. AR86 virus-permissive cells are cells which, upon transfection 
with the viral RNA transcript, are capable of producing viral particles. The 
S.A.AR86 virus has a broad host cell range. Examples of suitable host cells 
include, but are not limited to Vero cells, baby hamster kidney (BHK-21) cells, 
30 and chicken embryo fibroblast cells. Uptake of the RNA into the cells can be 
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achieved by any suitable means, such as for example, by treating the cells with 
. DEAE-dextran, treating the cells with "LIPOFECTIN™", and by electrophoresis, 
with electrophoresis being the currently preferred means of achieving RNA uptake 
into the host cells. 

5 The phrases "attenuating mutation" and "attenuating amino acid," 

as used herein, mean a nucleotide mutation or an amino acid coded for in view of 
such a mutation which result in a decreased probability of causing disease in its 
host (i.e., a loss of virulence), in accordance v^ith standard terminology in the art. 
See, e.g., B. Davis, et al., Microbiology 132 (3d ed. 1980), whether the mutation 

10 be a substitution mutation or an in-frame deletion mutation. The phrase 
"attenuating mutation" excludes mutations which would be lethal to the virus. 

The phrase "silent mutation" as used herein refers to mutations in 
the cDNA coding sequence which do not produce mutations in the corresponding 
protein sequence translated therefrom. 

15 The cDNA clone has a sequence as given herein as SEQ ID NO.: 

1. Alternatively, the cDNA clone may have a sequence which differs from the 
cDNA of SEQ ID NO.: 1, but which has the same RNA coding sequence as the 
cDNA given herein as SEQ ID NO.: 1. Thus, the cDNA clone may include one 
or more silent mutations. For example, the clone sequence may differ from the 

20 wild-type S.A.AR86 sequence given herein as SEQ ID NO.: 1, by the inclusion 
of silent mutations at any or all of nucleotides 215, 3863, 5984, and 9113. The 
silent mutations at the foregoing nucleotides may be substitution or in-frame 
deletion mutations, such as for example, the substitution of guanine for adenine at 
nucleotide 215 of the cDNA sequence given herein as SEQ ID NO.: 1; or the 

25 substitution of guanine for cytosine at nucleotide 3863 of the cDNA sequence 
given herein as SEQ ID NO.: 1; or the substitution of guanine for adenine at 
nucleotide 5984 of the cDNA sequence given herein as SEQ ID NO.: 1; or the 
substitution of cytosine for thymine at nucleotide 9113 of the cDNA sequence 
given herein as SEQ ID NO.: 1. In yet another emobidment, the cDNA clone has 

30 a sequence according to either of the foregoing described sequences, but which 
also includes attenuating mutations. The attenuating mutations being described 
more fully hereinafter. 
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Promoter sequences and S.A.AR86 virus cDNA clones are 
operatively associated in the present invention such that the promoter causes the 
cDNA clone to be transcribed in the presence of an RNA polymerase which binds 
to the promoter. The promoter is positioned on the 5' end (with respect to the 
5 virion RNA sequence), or "upstream" from, the cDNA clone. An excessive 
number of nucleotides between the promoter sequence arid the cDNA clone will 
result in the inoperability of the construct. Hence, the: number of nucleotides 
between the promoter seuqence and the cDNA'clone is preferably not more than 
eight, more preferably not more than five, still more preferably not more than 
10 three, and most preferably not more than one. Examples of promoters which are 
useful in the cDNA sequences of the present invention include, but are not limited 
to T3 promoters, 17 promoters, and SP6 promoters. The DNA sequence of the 
present invention may reside in any suitable transcription vector. The DNA 
sequence preferably has a complementary DNA sequence bonded thereto so that 
15 the double-stranded sequence will serve as an active template for RNA 
polymerase. The transcription vector preferably comprises a plasmid. When the 
DNA sequence comprises a plasmid, it is preferred that a unique restriction site 
be provided V (with respect to the virion RNA sequence) to (i.e., "downstream" 
from) the cDNA clone. This provides a means for linearizing the DNA sequence 
20 to allow the transcription of genome-length RNA in vitro. 

The cDNA clone can be generated by any of a variety of suitable 
methods known to those skilled in the art. A preferred method is the method set 
forth in U.S. Patent No. 5,185,440 to Davis et al., and Gubler et al., Gene 25:263 
(1983), the disclosures of which are incorporated herein by reference in their 
25 entirety. Attenuating mutations of S.A.AR86 are identified by sequencing 
attenuated strains of the S.A.AR86 virus and comparing the sequence of the 
attenuated strain with the sequence of the corresponding wild-type virus. Serial 
passage techniques for the generation of attenuated strains may be carried out in 
accordance with known procedures. Preferably, the atenuated strains are generated 
30 by selecting strains at each passage during serial passage in cell culture which 
either grow rapidly or penetrate the cell more rapidly. This selection process, 
which reduces the number of serial passages required to obtain attenuated strains, 
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is known. See, e.g., Olmstead et al., Science 225:424 (1984); and Johnston et al., 
Virology 162:437 (1988). the disclosures of which are incorporated herein by 
reference in their entirety . cDN A clones may be modified to incorporate 
attenuating mutations by site-directed mutagenesis in accordance with known 
5 procedures. An exemplary technique is that of Kunkel Proc. Natl Acad. Set 
(USA) 82:488 (1985). These same techniques may be used to join the 
heterologous promoter to the cDNA clone. 

RNA is preferably synthesized^from the DNA sequence in vitro 
using purified RNA polymerase in the presence of ribonucleotide triphosphates in 

10 accordance with conventional techniques. 

Pharmaceutical compositions, such as vaccines, containing the 
S.A.AR86 clone of the present invention comprise an immunogenic amount of a 
live attenutated virus as disclosed herein in combination with a pharmaceutical^ 
acceptable carrier. An "effective immunogenic amount" is an amount of the 

15 attenuated virus sufficient to evoke an immune response in the subject to which the 
vaccine is administered. An amount of about 10 1 to about 10 5 plaque forming 
units per dose is believed to be suitable, depending upon the age and species of the 
subject being treated. Examples of suitable pharmaceutical^ acceptable carriers 
include, but are not limited to, sterile pyrogen-free water and sterile pyrogen-free 

20 physiological saline solution. Subjects which may be administered immunogenic 
amounts of the live attenuated viruses of the present invention include both human 
and animal (e.g., horse, donkey, mouse, hamster, or monkey) subjects. 
Administration may be by a suitable means, such as intraperitoneoal, intracerebral 
or intramuscular injection. 

25 Complimentary DNA clones of the S.A.AR86 virus are made in 

accordance with the procedures described herein, as supplemented with procedures 
known in the art. We employed as a starting material, the S.A.AR86 virus. 

A first exemplary attenuating substitution mutation in a S.A.AR86 
clone useful in practicing the present invention is a substitution mutation which 

30 codes for an attenuating amino acid, preferably isoleucine, at nsPl amino acid 
residue 538. 
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A second exemplary attenuating subsitution mutation in a S. A. AR86 
clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably threonine, at E2 amino acid 
residue 304. 

5 A third exemplary attenuating subsitution mutation in a S.A.AR86 

clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably lysine, at E2 amino acid residue 
314. S 

A fourth exemplary attenuating subsitution mutation in a S.A.AR86 
10 clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably valine, at E2 amino acid residue 
372. 

A fifth exemplary attenuating substitution mutation in a S.A.AR86 
clone useful in practicing the present invention is a substitution mutation which 
15 codes for an attenuating amino acid, preferably alanine, at E2 amino acid residue 
376. 

One embodiment of the present invention contains, in combination, 
said attenuating substitution mutations at E2 amino acid residues 304, 314, 372, 
and 376. 

20 A sixth exemplary attenuating subsitution mutation in a S.A.AR86 

clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably glycine, at nsP2 amino acid residue 
96. 

A seventh exemplary attenuating subsitution mutation in a S . A . AR86 
25 clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably valine, at nsP2 amino acid residue 
372. 

One embodiment of the present invention contains, in combination, 
said attenuating substitution mutations at nsP2 amino acid residues 96 and 372. 
30 An eighth exemplary attenuating substitution mutation in an 

S.A.AR86 clone useful in practicing the present invention is a substitution 
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mutation which codes for an attenuating amino acid, preferably leucine, at nsP2 

amino acid residue 529. 

a _- *u ava «ni>i4rtf attorniatincr ciihQtitiitinn mutation in an 

/\ lllUCLll ^iVWULApXIAA J uuwu*m*m** 6 " — 

S.A.AR86 clone useful in practicing the present invention is a substitution 
5 mutation which codes for an attenuating amino acid, preferably asparagine, at nsP2 

amino acid residue 571. 

A tenth exemplary attenuating substitution mutation in an S. A. AR86 
clone useful in practicing the present invention is a substitution mutation which 
codes for an attenuating amino acid, preferably arginine, at nsP2 amino acid 
10 residue 682. 

An eleventh exemplary attenuating substitution mutation in an 
S.A.AR86 clone useful in practicing the present invention is a substitution 
mutation which codes for an attenuating amino acid, preferably arginine, at nsP2 

amino acid residue 804. 
15 A twelveth exemplary attenuating substitution mutation in an 

S.A.AR86 clone useful in practicing the present invention is a substitution 
mutation which codes for an attenuating amino acid, preferably arginine, at nsP3 

amino acid residue 22. 

One embodiment of the present invention contains, in combination, 
20 said attenuating substitution mutations at nsP2 amino acid residues 529, 571 , 682, 
and 804, and at nsP3 amino acid residue 22. 

The cDNA clones according to the present invention are useful for 
the preparation of pharmaceutical formulations, such as vaccines, as described 
above. In addition, the cDNA clones of the present invention are useful for 
25 administration to animals for the purpose of producing antibodies to the S.A.AR86 
virus, which antibodies may be collected and used in known diagnostic techniques 
for the detection of S.A.AR86 virus. 

The following examples are provided to illustrate the present 
invention, and should not be construed as limiting thereof. In these examples, nt 
30 means nucleotide. 
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EXAMPLE 1 

Relationship of S.A.AR86 Clone to Other Sindbis Strains 

At the nucleotide level, S.A.AR86 differs from the consensus 
sequence of Sindbis strain AR339 by 685 nucleotides (11 1 amino acids). From the 

5 published Sindbis HR sequence (See, Strauss et aL, Virol. 133:92 (1984)), 
S.A.AR86 differs by 704 nucleotides and 119 amino acids. S.A.AR86 differs 
from the sequence of Ockelbo82 (See, Shirako et al., Virol. 182:753 (1991)) by 
430 nucleotides (67 amino acids). Included in these differences are several 
insertions and deletions present in S.A.AR86 relative to the Sindbis sequences The 

0 relationship of the 3' half of the nsP3 gene among various Sindbis-like isolates is 
shown in Figure 1. 

EXAMPLE 2 
Observed Mortality in Mice Infected with S.A.AR86 
When adult mice are inoculated intracerebrally (i.e.) with low doses 
5 of S.A.AR86, 100% motality is observed. The methods employed for evaluating 
the instance mortality in mice are set forth in D. Russell, et al., J. ViroL 
63(4):1619 (1989), the disclosure of which is incorporated herein by reference in 
its entirety. This high mortality rate is unique among "Sindbis-like" viruses which 
are characteristically avirulent in adult mice. Also unique in the S.A.AR86 strain 
3 is the cys substitution for the opal stop codon normally found between the nsP3 
and nsP4 genes. 

EXAMPLE 3 
Construction of S.A.AR86 Clone 

The S.A.AR86 clone is constructed by substituting partial cDNA 
5 clones of S.A.AR86 genomic RNA into pTRSOOO, one of a series of Sindbis 
AR339 cDNA clones. Construction of pTR5000 (a full-length cDNA clone of 
Sindbis following the SP6 phage promoter and containing mostly Sindbis AR339 
sequences) is accomplished by sequential replacement of AR339-derived cDNAs 
into the plasmid pTotollOl background, according to the technique described in 



PCT/DS96/07457 

WO 96/37220 -10- 

Rice et al., J. Virol. 61:3809 (1987). The replacement of of the AR339-derived 
cDN As into the plasmid pTotol 101 background is shown in Figure 2. 

Production of the cDNAs used in constructing pTR5000 has been 
^viouslv in Polo et al., J. Virol. 62:2124 (1988), as has the 
5 construction of pTR2000, see Poloet al., J. Virol. 62:2124 (1988) and Polo etal., 
/. Virol. 64:4438 (1990)! Nucleotide numbering follows that of Strauss et al. 

Virol. 133:92 (1984). 

The StuI (nt8571) to SacII (ntll484) fragment of pTotollOl is 
removed and replaced with the analogous fragment from clone pSB4 to form 
10 P TR2000, using the loss of the pTotollOl StuI site at nt!0770 (a site not present 
in AR339) as a screen, as shown in Figure 2. The sequences of AR339 and 
pTotollOl are identical from nt!1485 to the 3'-end (ntll703). Therefore, as 
shown in Figure 2, these 3' sequences are of AR339 origin. Construction of 
pTR3000 is accomplished by replacement of the BssHH (nt9804) to StuI (nt8571) 
15 fragment of P SB3 into pTR2000 from which the analogous fragment had been 
removed. The Affll site found at nt8835 in pTotollOl but absent in AR339 is 
used to screen the recombinants. An AR339 fragment from pSBl, Spel (nt5262) 
to BssHU, is used to replace the Spel-BssHH fragment from P TR2000, using the 
AflH screen and forming P TR4000. To construct pTRSOOO, pSB5 is subcloned 
20 into pUC119, and the PstI site at nt3953 is ablated using site-directed mutagenesis, 
as described in Kunkel, Methods Enzymol. 154:367 (1987), to change nt3950 
from U to C. The Clal (nt2713) to Spel fragment is removed from the 
mutagenized subclone and for P TR5000, is used to replace the analogous fragment 
of pTR4000 using the ablated PstI site as a screen. 
25 * Partial cDNA clones of S.A.AR86 were obtained using classical 

reverse transcriptase (RT) procedures according to Polo et al., J. Virol. 62:2124 
(1988), as well as RT-PCR protocols according to Heidner et al., /. Virol. 
68-2683 (1994). These cDNA clones are used to replace analogous portions of the 
clone pTRSOOO, cutainating in the construction of a full-length cDNA of the 
30 S.A.AR86 genomic sequence downstream of an SP6 promoter and followed by a 
poly (A) tract and a unique Xbal site. During the course of replacing S.A. AR86 
sequences into pTRSOOO, it was observed that the pTRSOOO nonstructural proteins 
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are incompatible with those of S.A.AR86, so that the chimeric clones yielded 
transcripts which are not infectious for baby hamster kidney (BHK-21) cells. In 
addition, the restriction sites in the nonstructural region of S.A.AR86 are very 
different from the Sindbis AR339-based clones. The first complete S.A.AR86 
5 clone, pSIO, also failed to yield infectious transcripts. 

The construction is repeated, beginning with pTR5000, using 
sequences derived from the same partial cDNA clones of S.A.AR86 as are used 
for the construction of pSIO except for nucleotides 3171 to 6410 (numbering from 
nucleotide 1 of the S.A.AR86 sequence), which are derived by RT-PCR of the 
10 genomic RNA. The resulting construct, pS22, gives infectious transcripts, but the 
virus derived therefrom is temperature-sensitive. Replacement of nucleotides 3171 
to 6410 with an alternative RT-PCR derived cDNA corrected the temperature- 
sensitive defect, yielding clone pS24. 

EXAMPLE 4 

15 Observed Mortality in Mice Infected with S.A.AR86 

Upon i.e. inoculation virus derived from pS24 is avirulent, whereas 
S.A.AR86 caused 100% mortality. Clearly, pS24 contained one or more 
mutations which were strongly attenuating. The complete sequence of pS24 was 
determined directly from pS24 and related clones. Comparison with the 

20 S.A.AR86 genomic RNA sequence reveals 5 mutations or clusters of mutations 
which are potentially associated with the avirulent phenotype of virus from pS24. 
These included the mutations or clusters of mutations indicated in clones pS56, 
pS51, and pS57, a mutation at ntl278 A-C (nsPl 407 K-Q), and a mutation at 
nt5972 T-G (nsP3 228 N-S). While said nsPl 407 substitution alone is a lethal 

25 mutation, the viruses are viable when the mutation co-exists with a serine at the 
nsP3 amino acid residue 228. Clone pS24 is corrected by by a combination of site- 
directed mutagenesis and replacement of specific pS24 sequences with cDNA 
fragments which do not contain the subject mutations. The resulting cDNA is 
pS55 which contains 5* and 3* untranslated sequences identical to S.A.AR86 

30 genomic RNA, no coding differences with S.A.AR86 genomic RNA, and the 4 
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non-coding changes at nt 215 A-G, nt 3863 C-G, nt 5984 A-G, and nt 9113 T-C. 
Virus derived from pS55 is indistinguishable from native S.A.AR86 in tests of 
virulence, and growth in adult mice and by histopathological analysis of tissues 
from infected animals. These results, which are reported in Table 1 below, 
5 indicate that virus derived from clone pS55 accurately reflects native S.A.AR86 
both in terms of coding sequence and in vivo phenotype. 



Table 1 
VIRULENCE IN MICE 



AGE OF MICE | 


1 S.A.AR86 


S55 


S56 2 


S51 3 


4 to 6 Weeks 










Mortality 


. 100% 


100% 


80% 


20% 


AST 1 I 


1 6.36 ± 1.39 


6.37 ± 1.4 


9.37±1.77 


8.5 ± 0.7 



'Average Survival Time in days. 

^inis isogenic with S55 except at nucleotide 6 (C- A). 
15 'Virus isogenic with S55 except at nucleotide 1648 (C -T). This is nsPl amino acid 538 (T -I) 
The isoleucine is the amino acid found in all other Sindbis isolates sequenced to date. 

The 3 mutations or clusters of mutations in clones pS56, pS51, and 
pS57 are placed independently into the pS55 background. Virus derived from each 
of these clones is highly attenuated in adult mice inoculated i.e. In clone pS53, 

20 the mutations at nucleotides 6 and 1672 are combined, and the resulting virus is 
avirulent in the adult mouse model. The mutations in pS61 are present in pS48, 
an intermediate clone constructed during the repair of pS24. The virus from pS48 
produced small plaques on BHK-21 cells. When these mutations are placed in the 
pS55 background, they also gave a highly attenuated phenotype. (S61, virus derived 

25 from pS61, gave 33.3% mortality (6+4.2)). 

The foregoing is illustrative of the present invention and is not to 
be construed as limiting thereof. The invention is defined by the following claims, 
with equivalents of the claims to be included therein. 
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SEQUENCE LISTING 

(1) GENERAL INFORMATION : 

(i) APPLICANT: Johnston. Robert E. 

Simpson. Dennis 
Davis. Nancy L. 

(i1) TITLE OF INVENTION: cDNA Clone for South African Arbovirus No. 86 

(111) NUMBER OF SEQUENCES: 1 

(iv) CORRESPONDENCE ADDRESS : y* 

(A) ADDRESSEE: Kenneth D. Sibley 

(B) STREET: Post Office Drawer 34009 

(C) CITY: Charlotte 

(D) STATE: NC 

(E) COUNTRY: USA 

(F) ZIP: 28234 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0. Version #1.30 

(vi ) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Sibley. Kenneth D. 

(B) REGISTRATION NUMBER: 31.665 

(C) REFERENCE/DOCKET NUMBER: 5470-118 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (919) 881-3140 

(B) TELEFAX: (919) 881-3175 

(C) TELEX: 575102 



(2) INFORMATION FOR SEQ ID NO:l: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11663 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(iii) HYPOTHETICAL: NO 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:l: 

ATTGGCGGCG TAGTACACAC TATTGAATCA AACAGCCGAC CAATTGCACT ACCATCACAA 60 

TGGAGAAGCC AGTAGTTAAC GTAGACGTAG ACCCTCAGAG TCCGTTTGTC GTGCAACTGC 120 

AAAAGAGCTT CCCGCAATTT GAGGTAGTAG CACAGCAGGT CACTCCAAAT GACCATGCTA 180 
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ATGCCAGAGC ATTTTCGCAT CTGGCCAGTA MCTAATCGA GCTGGAGGTT CCTACCACAG 240 

CGACGATTTT GGACATAGGC AGCGCACCGG CTCGTAGAAT GTTTTCCGAG CACCAGTACC 300 

ATTGCGTTTG CCCCATGCGT AGTCCAGAAG ACCCGGACCG CATGATGAAA TATGCCAGCA 360 

AACTGGCGGA AAAAGCATGT AAGATTACAA ACAAGAACTT GCATGAGAAG ATCAAGGACC 420 

TCCGGACCGT ACTTGATACA CCGGATGCTG AAACGCCATC ACTCTGCTTC CACAACGATG 480 

TTACCTGCAA CACGCGTGCC GAGTACTCCG TCATGCAGGA CGTGTACATC AACGCTCCCG 540 

GAACTATTTA CCACCAGGCT ATGAAAGGCG TGCGGACCCT GTACTGGATT GGCTTCGACA 600 

CCACCCAGTT CATGTTCTCG GCTATGGCAG GTTCGTACCC TGCATACAAC ACCAACTGGG 660 

CCGACGAAAA AGTCCTTGAA GCGCGTAACA TCGGACTCTG CAGCACAAAG CTGAGTGAAG 720 

GCAGGACAGG AAAGTTGTCG ATAATGAGGA AGAAGGAGTT GAAGCCCGGG TCACGGGTTT 780 

ATTTCTCCGT TGGATCGACA CTTTACCCAG AACACAGAGC CAGCTTGCAG AGCTGGCATC 840 

TTCCATCGGT GTTCCACTTG AAAGGAAAGC AGTCGTACAC TTGCCGCTGT GATACAGTGG 900 

TGAGCTGCGA AGGCTACGTA GTGAAGAAAA TCACCATCAG TCCCGGGATC ACGGGAGAAA 960 

CCGTGGGATA CGCGGTTACA AACAATAGCG AGGGCTTCTT GCTATGCAAA GTTACCGATA 1020 

CAGTAAAAGG AGAACGGGTA TCGTTCCCCG TGTGCACGTA TATCCCGGCC ACCATATGCG 1080 

ATCAGATGAC CGGCATAATG GCCACGGATA TCTCACCTGA CGATGCACAA AAACTTCTGG 1140 

TTGGGCTCAA CCAGCGAATC GTCATTAACG GTAAGACTAA CAGGAACACC AATACCATGC 1200 

AAAATTACCT TCTGCCAATC ATTGCACAAG GGTTCAGCAA ATGGGCCAAG GAGCGCAAAG 1260 

AAGATCTTGA CAATGAAAAA ATGCTGGGCA CCAGAGAGCG CAAGCTTACA TATGGCTGCT 1320 

TGTGGGCGTT TCGCACTAAG AAAGTGCACT CGTTCTATCG CCCACCTGGA ACGCAGACCA 1380 

TCGTAAAAGT CCCAGCCTCT TTTAGCGCTT TCCCCATGTC ATCCGTATGG ACTACCTCTT 1440 

TGCCCATGTC GCTGAGGCAG AAGATGAAAT TGGCATTACA ACCAAAGAAG GAGGAAAMC 1500 

TGCTGCAAGT CCCGGAGGAA TTAGTTATGG AGGCCAAGGC TGCTTTCGAG GATGGTCAGG 1560 

AGGAATCCAG AGCGGAGAAG CTCCGAGAAG CACTCCCACC ATTAGTGGCA GACAAAGGTA 1620 

TCGAGGCAGC TGCGGAAGTT GTCTGCGAAG TGGAGGGGCT CCAGGCGGAC ACCGGAGCAG 1680 

CACTCGTCGA AACCCCGCGC GGTCATGTAA GGATAATACC TCAAGCAAAT GACCGTATGA 1740 

TCGGACAGTA TATCGTTGTC TCGCCGATCT CTGTGCTGAA GAACGCTAAA CTCGCACCAG 1800 

CACACCCGCT AGCAGACCAG GTTAAGATCA TAACGCACTC CGGAAGATCA GGAAGGTATG 1860 

CAGTCGAACC ATACGACGCT AAAGTACTGA TGCCAGCAGG AAGTGCCGTA CCATGGCCAG 1920 

AATTCTTAGC ACTGAGTGAG AGCGCCACGC TTGTGTACAA CGAAAGAGAG TTTGTGAACC 1980 

GCAAGCTGTA CCATATTGCC ATGCACGGTC CCGCTAAGAA TACAGAAGAG GAGCAGTACA 2040 

AGGTTACAAA GGCAGAGCTC GCAGAAACAG AGTACGTGTT TGACGTGGAC AAGAAGCGAT 2100 

GCGTTAAGAA GGAAGAAGCC TCAGGACTTG TCCTTTCGGG AGAACTGACC AACCCGCCCT 2160 

ATCACGAACT AGCTCTTGAG GGACTGAAGA CTCGACCCGC GGTCCCGTAC AAGGTTGAAA 2220 



WO 96/37220 PCTAJS96/074S7 

-15- 



CAATAGGAGT 


GATAGGCACA 


CCAGGATCGG 


GCAAGTCAGC 


TATCATCAAG 


TCAACTGTCA 


2280 


CGGCACGTGA 


TCTTGTTACC 


AGCGGAAAGA 


AAGAAAACTG 


CCGCGAAATT 


GAGGCCGACG 


2340 


TGCTACGGCT 


GAGGGGCATG 


CAGATCACGT 


CGAAGACAGT 


GGATTCGGTT 


ATGCTCAACG 


2400 


GATGCCACAA 


AGCCGTAGAA 


GTGCTGTATG 


TTGACGAAGC 


GTTCCGGTGC 


CACGCAGGAG 


2460 


CACTACTTGC 


CTTGATTGCA 


ATCGTCAGAC 


CCCGTAAGAA 


GGTAGTACTA 


TGCGGAGACC 


2520 


CTAAGCAATG 


CGGATTCTTC 


AACATGATGC 


AACTAAAGGT 


ACATTTCAAC 


CACCCTGAAA 


2580 


AAGACATATG 


TACCAAGACA 


TTCTACAAGT 


TTATCTCCCG 


ACGTTGCACA 


CAGCCAGTCA 


2640 


CGGCTATTGT 


ATCGACACTG 


CATTACGATG 


GAAAAATGAA 


AACCACAAAC 


CCGTGCAAGA 


2700 


AGAACATCGA 


AATCGACATT 


ACAGGGGCCA 


CGAAGCCGAA 


GCCAGGGGAC 


ATCATCCTGA 


2760 


CATGTTTCCG 


CGGGTGGGTT 


AAGCAACTGC 


AAATCGACTA 


TCCCGGACAT 


GAGGTAATGA 


2820 


CAGCCGCGGC 


CTCACAAGGG 


CTAACCAGAA 


AAGGAGTATA 


TGCCGTCCGG 


CAAAAAGTCA 


2880 


ATGAAAACCC 


GCTGTACGCG 


ATCACATCAG 


AGCATGTGAA 


CGTGTTGCTC 


AGCCGCACTG 


2940 


AGGACAGGCT 


AGTATGGAAA 


ACTTTACAGG 


GCGACCCATG 


GATTAAGCAG 


CTCACTAACG 


3000 


TACCTAAAGG 


AAATTTTCAG 


GCCACCATCG 


AGGACTGGGA 


AGCTGAACAC 


AAGGGAATAA 


3060 


TTGCTGCGAT 


AAACAGTCCC 


GCTCCCCGTA 


CCAATCCGTT CAGCTGCAAG 


ACTAACGTTT 


3120 


GCTGGGCGAA 


AGCACTGGAA 


CCGATACTGG 


CCACGGCCGG TATCGTACTT 


ACCGGTTGCC 


3180 


AGTGGAGCGA 


GCTGTTCCCA 


CAGTTTGCGG 


ATGACAAACC ACACTCGGCC 


ATCTACGCCT 


3240 


TAGACGTAAT 


TTGCATTAAG 


TTTTTCGGCA 


TGGACTTGAC AAGCGGGCTG 


TTTTCCAAAC 


3300 


AGAGCATCCC 


GTTAACGTAC 


CATCCTGCCG 


ACTCAGCGAG GCCAGTAGCT 


CATTGGGACA 


3360 


ACAGCCCAGG 


AACACGCAAG 


TATGGGTACG 


ATCACGCCGT TGCCGCCGAA 


CTCTCCCGTA 


3420 


GATTTCCGGT 


GTTCCAGCTA 


GCTGGGAAAG 


GCACACAGCT TGATTTGCAG 


ACGGGCAGAA 


3480 


CTAGAGTTAT 


CTCTGCACAG 


CATAACTTGG 


TCCCAGTGAA CCGCAATCTC 


CCTCACGCCT 


3540 


TAGTCCCCGA 


GCACAAGGAG 


AAACAACCCG 


GCCCGGTCGA AAAATTCTTG 


AGCCAGTTCA 


3600 


AACACCACTC 


CGTACTTGTG 


ATCTCAGAGA 


AAAAAATTGA AGCTCCCCAC 


AAGAGAATCG 


3660 


AATGGATCGC 


CCCGATTGGC 


ATAGCCGGCG 


CAGATAAGAA CTACAACCTG 


GCTTTCGGGT 


3720 


TTCCGCCGCA 


GGCACGGTAC 


GACCTGGTGT 


TCATCAATAT TGGAACTAAA 


TACAGAAACC 


3780 


ATCAGTTCA 


ACAGTGCGAA 


GACCACGCGG 


CGACCTTGAA AACCCTTTCG 


CGTTCGGCCC 


3840 


TGAACTGCCT 


TAACCCCGGA 


GGCACCCTCG 


TGGTGAAGTC CTACGGTTAC 


GCCGACCGCA 


3900 


ATAGTGAGGA 


CGTAGTCACC 


GCTCTTGCCA 


GAAAATTTGT CAGAGTGTCT 


GCAGCGAGGC 


3960 


CAGAGTGCGT 


CTCAAGCAAT 


ACAGAAATGT 


ACCTGATTTT CCGACAACTA 


GACAACAGCC 


4020 


GCACACGACA 


ATTCACCCCG 


CATCATTTGA 


ATTGTGTGAT TTCGTCCGTG 


TACGAGGGTA 


4080 


CAAGAGACGG 


AGTTGGAGCC 


GCACCGTCGT 


ACCGTACTAA MGGGAGAAC 


ATTGCTGA7T 


4140 


GTCAAGAGGA 


AGCAGTTGTC 


AATGGAGCCA 


ATCCACTGGG CAGACCAGGA 


GAAGGAGTCT 


4200 


GCCGTGCCAT 


CTATAAACGT 


TGGCCGAACA 


GTTTCACCGA TTCAGCCACA 


GAGACAGGTA 


4260 
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CCGCAAAACT GACTGTGTGC CAAGGAAAGA AAGTGATCCA CGCGGTTGGC CCTGATTTCC 4320 

GGAAACACCC AGAGGCAGAA GCCCTGAAAT TGCTGCAAAA CGCCTACCAT GCAGTGGCAG 4380 

a pttapta a a tca apataat ATrAARTfTG TCGCCATCCC ACTGCTATCT ACAGGCATTT 4440 

ACGCAGCCGG AAAAGACCGC CTTGAGGTAT CACTTAACTG CTTGACAACC GCGCTAGACA 4500 

GAACTGATGC GGACGTAACC ATCTACTGCC TGGATAAGAA GTGGAAGGAA AGAATCGACG 4560 

CGGTGCTCCA ACTTAAGGAG TCTGTAACTG AGCTGAAGGA TGAGGATATG GAGATCGACG 4620 

ACGAGTTAGT ATGGATCCAT CCGGACAGTT GCCTGAAGGG AAGAAAGGGA TTCAGTACTA 4680 

CAAAAGGAAA GTTGTATTCG TACTTTGAAG GCACCAAATT CCATCAAGCA GCAAAAGATA 4740 

TGGCGGAGAT AAAGGTCCTG TTCCCAAATG ACCAGGAAAG CAACGAACAA CTGTGTGCCT 4800 

ACATATTGGG GGAGACCATG GAAGCAATCC GCGAAAAATG CCCGGTCGAC CACAACCCGT 4860 

CGTCTAGCCC GCCAAAAACG CTGCCGTGCC TCTGTATGTA TGCCATGACG CCAGAAAGGG 4920 

TCCACAGACT CAGAAGCAAT AACGTCAAAG AAGTTACAGT ATGCTCCTCC ACCCCCCTTC 4980 

CAAAGTACAA AATCAAGAAT GTTCAGAAGG TTCAGTGCAC AAAAGTAGTC CTGTTTAACC 5040 

CGCATACCCC CGCATTCGTT CCCGCCCGTA AGTACATAGA AGCACCAGAA CAGCCTGCAG 5100 

CTCCGCaGC ACAGGCCGAG GAGGCCCCCG GAGTTGTAGC GACACCAACA CCACCTGCAG 5160 

CTGATAACAC CTCGCTTGAT GTCACGGACA TCTCACTGGA CATGGAAGAC AGTAGCGAAG 5220 

GCTCACTCTT TTCGAGCTTT AGCGGATCGG ACAACTACCG AAGGCAGGTG GTGGTGGCTG 5280 

ACGTCCATGC CGTCCAAGAG CCTGCCCCTG TTCCACCGCC AAGGCTAAAG AAGATGGCCC 5340 

GCCTGGCAGC GGCAAGAATG CAGGAAGAGC CMCTCCACC GGCAAGCACC AGCTCTGCGG 5400 

ACGAGTCCCT TCACCTTTCT TTTGATGGGG TATCTATATC CTTCGGATCC CTTTTCGACG 5460 

GAGAGATGGC CCGCTTGGCA GCGGCACMC CCCCGGCAAG TACATGCCCT ACGGATGTGC 5520 

CTATGTCTTT CGGATCGTTT TCCGACGGAG AGATTGAGGA GTTGAGCCGC AGAGTAACCG 5580 

AGTCGGAGCC CGTCCTGTTT GGGTCATTTG AACCGGGCGA AGTGAACTCA ATTATATCGT 5640 

CCCGATCAGC CGTATCTTTT CCACCACGCA AGCAGAGACG TAGACGCAGG AGCAGGAGGA 5700 

CCGAATACTG TCTAACCGGG GTAGGTGGGT ACATATTTTC GACGGACACA GGCCCTGGGC 5760 

ACTTGCAAAA GAAGTCCGTT CTGCAGAACC AGCTTACAGA ACCGACCTTG GAGCGCAATG 5820 

TTCTGGAAAG AATCTACGCC CCGGTGCTCG ACACGTCGAA AGAGGAACAG CTCAAACTCA 5880 

GGTACCAGAT GATGCCCACC GAAGCCAACA AAAGCAGGTA CCAGTCTCGA AAAGTAGAAA 5940 

ACCAGAAAGC CATAACCACT GAGCGACTGC TTTCAGGGCT ACGACTGTAT AACTCTGCCA 6000 

CAGATCAGCC AGAATGCTAT AAGATCACCT ACCCGAAACC ATCGTATTCC AGCAGTGTAC 6060 

CAGCGAACTA CTCTGACCCA AAGTTTGCTG TAGCTGTTTG TAACAACTAT CTGCATGAGA 6120 

ATTACCCGAC GGTAGCATCT TATCAGATCA CCGACGAGTA CGATGCTTAC TTGGATATGG 6180 

TAGACGGGAC AGTCGCTTGC CTAGATACTG CAACTTTTTG CCCCGCCAAG CTTAGAAGTT 6240 

ACCCGAAAAG ACACGAGTAT AGAGCCCCAA ACATCCGCAG TGCGGTTCCA TCAGCGATGC 6300 
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AGAACACGTT GCAAAACGTG CTCATTGCCG CGACTAAAAG AAACTGCAAC GTCACACAAA 6360 

TGCGTGAACT GCCAACACTG GACTCAGCGA CATTCAACGT TGAATGCTTT CGAAAATATG 6420 

CATGCAATGA CGAGTATTGG GAGGAGTTTG CCCGAAAGCC AATTAGGATC ACTACTGAGT 6480 

TCGTTACCGC ATACGTGGCC AGACTGAAAG GCCCTAAGGC CGCCGCACTG TTCGCAAAGA 6540 

CGCATAATTT GGTCCCATTG CAAGAAGTGC CTATGGATAG ATTCGTCATG GACATGAAAA 6600 

GAGACGTGAA AGTTACACCT GGCACGAAAC ACACAGAAGA AAGACCGAAA GTACAAGTGA 6660 

TACAAGCCGC AGAACCCCTG GCGACCGCTT ACCTATGCGG GATCCACCGG GAGTTAGTGC 6720 

GCAGGCTTAC AGCCGTTTTG CTACCCAACA TTCACACGCT CTTJGACATG TCGGCGGAGG 6780 

ACTTTGATGC MTCATAGCA GAACACTTCA AGCAAGGTGA CCCGGTACTG GAGACGGATA 6840 

TCGCCTCGTT CGACAAAAGC CAAGACGACG CTATGGCGTT AACCGGCCTG ATGATCTTGG 6900 

AAGACCTGGG TGTGGACCAA CCACTACTCG ACTTGATCGA GTGCGCCTTT GGAGAAATAT 6960 

CATCCACCCA TCTGCCCACG GGTACCCGTT TCAAATTCGG GGCGATGATG AAATCCGGAA 7020 

TGTTCCTCAC GCTCTTTGTC AACACAGTTC TGAATGTCGT TATCGCCAGC AGAGTATTGG 7080 

AGGAGCGGCT TAAAACGTCC AAATGTGCAG CATTTATCGG CGACGACAAC ATTATACACG 7140 

GAGTAGTATC TGACAAAGAA ATGGCTGAGA GGTGTGCCAC CTGGCTCAAC ATGGAGGTTA 7200 

AGATCATTGA CGCAGTCATC GGCGAGAGAC CACCTTACTT CTGCGGTGGA TTCATCTTGC 7260 

AAGATTCGGT TACCTCCACA GCGTGTCGCG TGGCGGACCC CTTGAAAAGG CTGTTTAAGT 7320 

TGGGTAAACC GCTCCCAGCC GACGATGAGC AAGACGAAGA CAGAAGACGC GCTCTGCTAG 7380 

ATGAAACAAA GGCGTGGTTT AGAGTAGGTA TAACAGACAC CTTAGCAGTG GCCGTGGCAA 7440 

CTCGGTATGA GGTAGACAAC ATCACACCTG TCCTGCTGGC ATT6AGAACT TTTGCCCAGA 7500 

GCAAAAGAGC ATTTCAAGCC ATCAGAGGGG AAATAAAGCA TCTCTACGGT GGTCCTAAAT 7560 ■ 

AGTCAGCATA GTACATTTCA TCTGACTAAT ACCACAACAC CACCACCATG AATAGAGGAT 7620 

TCTTTAACAT GCTCGGCC6C CGCCCCTTCC CAGCCCCCAC TGCCATGTGG AGGCCGCGGA 7680 

GAAGGAGGCA GGCGGCCCCG ATGCCTGCCC GCAATGGGCT GGCTTCCCAA ATCCAGCAAC 7740 

TGACCACAGC CGTCAGTGCC CTAGTCATTG GACAGGCAAC TAGACCTCAA ACCCCACGCC 7800 

CACGCCCGCC GCCGCGCCAG AAGAAGCAGG CGCCAAAGCA ACCACCGAAG CCGAAGAAAC 7860 

CAAAAACACA GGAGAAGAAG AAGAAGCAAC CTGCAAAACC CAAACCCGGA AAGAGACAGC 7920 

GTATGGCACT TAAGTTGGAG GCCGACAGAC TGTTCGACGT CAAAAATGAG GACGGAGATG 7980 

TCATCGGGCA CGCACTGGCC ATGGAAGGAA AGGTAATGAA ACCACTCCAC GTGAAAGGAA 8040 

CTATTGACCA CCCTGTGCTA TCAAAGCTCA AATTCACCAA GTCGTCAGCA TACGACATGG 8100 

AGTTCGCACA GTTGCCGGTC AACATGAGAA GTGAGGCGTT CACCTACACC AGTGAACACC 8160 

CTGAAGGGTT CTACAACTGG CACCACGGAG CGGTGCAGTA TAGTGGAGGC AGATTTACCA 8220 

TCCCCCGCGG AGTAGGAGGC AGAGGAGACA GTGGTCGTCC GATTATGGAT AACTCAGGCC 8280 

GGGTTGTCGC GATAGTCCTC GGAGGGGCTG ATGAGGGAAC AAGAACCGCC CTTTCGGTCG 8340 
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TCACCTG6AA TAGCAAAGGG AAGACAATCA AGACAACCCC GGAAGGGACA GAAGAGTGGT 8400 

CTGCTGCACC ACTGGTCACG GCCATGTGCT TGCTTGGAAA CGTGAGCTTC CCATGCAATC 8460 

GCCCGCCCAC ATGCTACACC CGCGAACCAT CCAGAGCTCT CGACATCCTC GAAGAGAACG 8520 

TGAACCACGA GGCCTACGAC ACCCTGCTCA ACGCCATATT GCGGTGCGGA TCGTCCGGCA 8580 

GAAGTAAAAG AAGCGTCACT GACGACTTTA CCTTGACCAG CCCGTACTTG GGCACATGCT 8640 

CGTACTGTCA CCATACTGAA CCGTGCTTTA GCCCGATTAA GATCGAGCAG GTCTGGGATG 8700 

AAGCGGACGA CAACACCATA CGCATACAGA CTTCCGCCCA GTTTGGATAC GACCAAAGCG 8760 

GAGCAGCAAG CTCAAATAAG TACCGCTACA TGTCGCTCGA GCAGGATCAT ACTGTCAAAG 8820 

AAGGCACCAT GGATGACATC AAGATCAGCA CCTCAGGACC GTGTAGAAGG CTTAGCTACA 8880 

AAGGATACTT TCTCCTCGCG AAGTGTCCTC CAGGGGACAG CGTAACGGTT AGCATAGCGA 8940 

GTAGCAACTC AGCAACGTCA TGCACAATGG CCCGCAAGAT AAAACCAAAA TTCGTGGGAC 9000 

GGGAAAAATA TGACCTACCT CCCGTTCACG GTAAGAAGAT TCCTTGCACA GTGTACGACC 9060 

GTCTGAAAGA AACAACCGCC GGCTACATCA CTATGCACAG GCCGGGACCG CATGCCTATA 9120 

CATCCTATCT GGAGGAATCA TCAGGGAAAG TTTACGC6AA GCCACCATCC GGGAAGAACA 9180 

TTACGTACGA GTGCAAGTGC GGCGATTACA AGACCGGAAC CGTTACGACC CGTACCGAAA 9240 

TCACGGGCTG CACCGCCATC AAGCAGTGCG TCGCCTATAA GAGCGACCM ACGAAGTGGG 9300 

TCTTCAACTC GCCGGACTCG ATCAGACACG CCGACCACAC GGCCCAAGGG AAATTGCATT 9360 

TGCCTTTCAA 6CTGATCCCG AGTACCTGCA TGGTCCCTGT TGCCCACGCG CCGAACGTAG 9420 

TACACGGCTT TAAACACATC AGCCTCCAAT TAGACACAGA CCATCTGACA TTGCTCACCA 9480 

CCAGGAGACT AGGGGCAAAC CCGGAACCAA CCACTGAATG GATCATCGGA AACACGGTTA 9540 

GAAACTTCAC CGTCGACCGA GATGGCCTGG AATACATATG GGGCAATCAC GAACCAGTAA 9600 

GGGTCTATGC CCAAGAGTCT GCACCAGGAG ACCCTCACGG ATGGCCACAC GAAATAGTAC 9660 

AGCATTACTA TCATCGCCAT CCTGTGTACA CCATCTTAGC CGTCGCATCA GCTGCTGTGG 9720 

CGATGATGAT TGGCGTAACT GTTGCAGCAT TATGTGCCTG TAAAGCGCGC CGTGAGTGCC • 9780 

TGACGCCATA TGCCCTGGCC CCAAATGCCG TGATTCCAAC TTCGCTGGCA CTTTTGTGCT 9840 

GTGTTAGGTC .GGCTAATGCT GAAACATTCA CCGAGACCAT GAGTTACTTA TGGTCGAACA 9900 

GCCAGCCGTT CTTCTGGGTC CAGCTGTGTA TACCTCTGGC CGCTGTCGTC GTTCTAATGC 9960 

GCTGTTGCTC ATGCTGCCTG CCTTTTTTAG TGGTTGCCGG CGCCTACCTG GCGAAGGTAG 10020 

ACGCCTACGA ACATGCGACC ACTGTTCCAA ATGTGCCACA GATACCGTAT AAGGCACTTG 10080 

TTGAAAGGGC AGGGTACGCC CCGCTCAATT TGGAGATTAC TGTCATGTCC TCGGAGGTTT 10140 

TGCCTTCCAC CAACCAAGAG TACATTACCT GCAAATTCAC CACTGTGGTC CCCTCCCCTA 10200 

AAGTCAGATG CTGCGGCTCC TTGGAATGTC AGCCCGCCGC TCACGCAGAC TATACCTGCA 10260 

AGGTCTTTGG AGGGGTGTAC CCCTTCATGT GGGGAGGAGC ACAATGTTTT TGCGACAGTG 10320 
AGAACAGCCA GATGAGTGAG GCGTACGTCG AATTGTCAGT AGATTGCGCG ACTGACCACG 10380 



WO 96/37220 



PCT/US96/07457 



-19- 

CGCAGGCGAT TAAGGTGCAT ACTGCCGCGA TGAAAGTAGG ACTGCGTATA GTGTACGGGA 10440 

ACACTACCAG TTTCCTAGAT GTGTACGTGA ACGGAGTCAC ACCAGGAACG TCTAAAGACC 10500 

TGAAAGTCAT AGCTGGACCA ATTTCAGCAT TGTTTACACC ATTCGATCAC AAGGTCGTTA 10560 

TCAATCGCGG CCTGGTGTAC AACTATGACT TTCCGGAATA CGGAGCGATG AAACCAGGAG 10620 

CGTTTGGAGA CATTCAAGCT ACCTCCTTGA CTAGCAAAGA CCTCATCGCC AGCACAGACA 10680 

TTAGGCTACT CAAGCCTTCC GCCAAGAACG TGCATGTCCC GTACACGCAG GCCGCATCTG 10740 

GATTCGAGAT GTGGAAAAAC AACTCAGGCC GCCCACTGCA GGAAACCGCC CCTTTTGGGT 10800 

GCAAGATTGC AGTCAATCCG CTTCGAGCGG TGGACTGCTC AJ<A'CGGGMC ATTCCCATiT 10860 

CTATTGACAT CCCGAACGCT GCCTTTATCA GGACATCAGA TGCACCACTG GTCTCAACAG 10920 

TCAAATGTGA TGTCAGTGAG TGCACTTATT CAGCGGACTT CGGAGGGATG GCTACCCTGC 10980 

AGTATGTATC CGACCGCGAA GGACAATGCC CTGTACATTC GCATTCGAGC ACAGCAACCC 11040 

TCCAAGAGTC GACAGTTCAT GTCCTGGAGA AAGGAGCGGT GACAGTACAC TTCAGCACCG 11100 

CGAGCCCACA GGCGAACTTC ATTGTATCGC TGTGTGGTAA GAAGACAACA TGCAATGCAG 11160 

AATGCAAACC ACCAGCTGAT CATATCGTGA GCACCCCGCA CAAAAATGAC CAAGAATTCC 11220 

AAGCCGCCAT CTCAAAAACT TCATGGAGTT GGCTGTTTGC CCTTTTCGGC GGCGCCTCGT 11280 

CGCTATTAAT TATAGGACTT ATGATTTTTG CTTGCAGCAT GATGCTGACT AGCACACGAA 11340 

GATGACCGCT ACGCCCCAAT GACCCGACCA GCAAAACTCG ATGTACTTCC GAGGAACTGA 11400 

TGTGCATAAT GCATCAGGCT GGTATATTAG ATCCCCGCTT ACCGCGGGCA ATATAGCAAC 11460 

ACCAAAACTC GACGTATTTC CGAGGAAGCG CAGTGCATAA TGCTGCGCAG TGTTGCCAAA 11520 

TAATCACTAT ATTAACCATT TATTCAGCGG ACGCCAAAAC TCAATGTATT TCTGAGGAAG 11580 

CATGGTGCAT AATGCCATGC AGCGTCTGCA TAACTTTTTA TTATTTCTTT TATTAATCAA 11640 

CAAAATTTTG TTTTTAACAT TTC 11663 
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That Which Is Claimed Is: 

1. A recombinant DNA comprising a cDNA coding for an 
lmectious ooum iud.ii ^vt usj v h ua nu. uu *^wwy tuu« auwi — — 
a heterologous promoter positioned upstream from said cDNA and operatively 

5 associated therewith. 

2. The recombinant DNA according to Claim 1, wherein said 
cDNA is selected from the group consisting of (z) cDNA having the sequence 
given herein as SEQ ID NO.: 1, (if) cDNA having the same protein coding 
sequence as the cDNA given herein as SEQ ID NO.: 1, and (Hi) cDNA according 

10 to (i) or (if) above, and further comprising at least one attenuating mutation in said 
cDNA. 

3. The recombinant DNA according to Claim 1, wherein said 
cDNA has the sequence given herein as SEQ ID NO.: 1. 

4. The recombinant DNA according to Claim 1, further 
15 comprising at least one attenuating mutation in said cDNA clone. 

5. The recombinant DNA according to Claim 1, further 
comprising at least two attenuating mutations in said cDNA clone. 

6. The recombinant DNA according to Claim 5, wherein each 
of said attenuating mutations are in the region selected from the group consisting 

20 of the nsPl coding region, E2 coding region, and nsP2 coding region. 

7. The recombinant DNA according to Claim 1, further 
comprising at least one attenuating mutation selected from the group consisting of 
codons at nsPl amino acid position 538 which specify an attenuating amino acid, 
codons at E2 amino acid position 304 which specify an attenuating amino acid, 

25 codons at E2 amino acid position 314 which specify an attenuating amino acid, 
codons at E2 amino acid position 372 which specify an attenuating amino acid, 
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codons at E2 amino acid position 376 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 96 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 372 which specify an attenuating amino acid; 
codons at nsP2 amino acid position 529 which specify an attenuating amino acid, 
5 codons at nsP2 amino acid position 571 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 682 which specify an attenuating amino acid, 
codons at nsP2 amino acid position 804 which specify an attenuating amino acid, 
and codons at nsP3 amino acid position 22 wfuch specify an attenuating amino 
acid. 

10 8. The recombinant DNA according to Claim 7, wherein said 

attenuating mutation comprises a substitution mutation. 

9. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for isoleucine at nsPl amino acid 538. 

10. The recombinant DNA according to Claim 8, wherein said 
15 substitution mutation codes for threonine at E2 amino acid 304. 

11. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for lysine at E2 amino acid 314. 

12. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for valine at E2 amino acid 372. 

20 13. The recombinant DNA according to Claim 8, wherein said 

substitution mutation codes for alanine at E2 amino acid 376. 

14. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for glycine at nsP2 amino acid 96. 
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15. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for valine at nsP2 amino acid 372. 

16. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for leucine at nsP2 amino acid 529. 

5 17. The recombinant DNA according to Claim 8, wherein said 

substitution mutation codes for asparagine at nsP2 amino acid 571. 

18. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for arginine at nsP2 amino acid 682. 

19. The recombinant DNA according to Claim 8, wherein said 
10 substitution mutation codes for arginine at nsP2 amino acid 804. 

20. The recombinant DNA according to Claim 8, wherein said 
substitution mutation codes for arginine at nsP3 amino acid 22. 

21. The recombinant DNA according to Claim 1 further 
comprising at least one silent mutation. 

15 22. The recombinant DNA according to Claim 2 1 , wherein said 

silent mutation is located at a position selected from the group consisting of 
nucleotide 215, nucleotide 3863, nucleotide 5984 and nucleotide 9113. 

23. The recombinant DNA according to Claim 1, wherein not 
more than eight nucleotides are positioned between said promoter and said cDNA 

20 clone. 

24. The recombinant DNA according to Claim 23 , wherein said 
promoter is selected from the group consisting of T3 promoters, T7 promoters, 
and SP6 promoters. 
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25. The recombinant DNA according to Claim 1, wherein said 
recombinant DNA comprises a plasmid, and wherein said recombinant DNA 
further comprises a unique restriction site positioned downstream from said cDNA 
clone. 

5 26. An infectious RNA transcript encoded by a cDNA according 

to Claim 1. 

27. Infectious attenuated viral particles containing an RNA 
transcript of Claim 26. 



10 



28. A pharmaceutical composition comprising an effective 
immunogenic amount of an infectious attenuated virus according to Claim 27 in 
combination with a phannaceutically acceptable carrier. 
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